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(57)Abstract 

PURPOSE: To obtain a new nitrile hydratase gene having a specific 
base sequence, coding for nitrile hydratase stemmed from 
Rhodococcus bacteria, and useful for e.g. producing an enzyme 
capable of giving amides or carboxylic acids useful as raw materials 
for medicines, liquid crystal material, starting from nitriles. 
CONSTITUTION: The new gene has a sequence of the formula and 
codes for a nitride hydratase stemmed from Rhodococcus bacteria 
[e.g. Rhodococcus rhodochrous IF0 15564 strain], therefore being 
useful for e.g. producing through the genetic engineering technology 
a nitrile hydratase capable of giving amides or carboxylic acids 
useful, as raw materials for medicines, liquid crystal material, etc., 
starting fi^om nitrile compounds. This enzyme gene is obtained by 
the following process: the above strain is cultured and the 
chromosome DNA therein is extracted, and using the DNA, a 
chromosome library is constructed, and then screened using a nitrile 
hydratase gene fragment as a probe, followed by recovering the 
aimed DNA from a positive strain. 
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C57) [S*5J] 



(2) 

1 

D— Ki'-7X ( Rhodococcus r h o d o c 
h r ou s ) I F O 1 5 5 6 Ai^-CiyZn^TS. 1 IBIS© 

P— Ki'^^ ( Rhodococcus r h o d o 
oh r o u s ) I F O 1 5 5 6 4»-CASI»*^2iBt8 

[000 n 

[0 0 0 2] 

[SeJfeOCfJB] y^l't h-^i»— tffeJiOfrSiSC-Hz 20 

<«F4iBB5 6- 1 7 9 18#42r*g. i1$^Bg5 9-37 9 

5 1 ^^|g. !1#<&BS6 1 - 1 6 2 1 9 3^<2rfB> WiiVS 30 

6 1-2151 9-^4i«. It^age 4-8688 9-^4:^ 

«F^^^4 - 1 9 7 1 8 9#4i«. EP044464 
0 . 

[000 3] SJ/c. cn^<Dfilldfeia*>6ti- h U^bt K 

- h y;l'b f — tf3iei^{c-3t,>-C«. Mjttf P- K 
3 *X^*Bl|ft*©jie^*l!|#gg¥2 - 1 1 9 7 7 8 
-^^^■^EPO 44 5 64 6«:*5l,>r. i^a- K*:^;^ 40 
S*fflSS*CDiH£^*s#iB^3 - 2 5 1 1 8 

tei»-c. >; yi^ AS«iSi**©ae^^:i^«FM¥6 - 2 

5 2 9 6-^^2iffi-«»!f$^jp6 - 3 0 3 9 7 1 ^<Z^K:tet,» 

^ 'J •> AattHi P - F 3 V * ;^BHBlf 

*JEPO 433 1 1 7«Cg3iK3n-Cl»S, Se^KP-F 
3 V ^XHifflffii*©- F y^l/t F^ ^-Hzite^fc J; 

im^lfi^m^b- 8 8 5 5 6^^-CK^Stirt> 

^. so 



i^ga¥9-997 3 

2 

[0004] J5^, C ©J: -5 tSi^^mifii>':>z. V V Mt 

^©^^S6*tc.ffl-rsK^?f*i<£$nrt,>s. ^io^m 

«fii©ig(,»^^ri14^t^©iiji«:fM-r-5./ci&k:. Si 

2 - 8 4 1 9 8 ^^iC^m^xSmtJ: a -gfeW« 
K©»jS«:ffl(,i^.SS[^!^{cot,>-c. !^g3¥4-34 1 1 
8 5^4:j«fC»5fe^tgtt%2 - t FP + i'^^UsK^K© 
^ig^CfflC^^a^igj^COli-r, E PO 4 3 3 1 1 7K.it 
^^imHir h T'P^ i>©i!{jgtCffl(,»Sffl[^«JK:-3(,> 

r^n-enpa^T^snrc^s. Mayaux 

e.»S ( + ) -2-7x-;U7'Of*>®©SjS(Cfflli 

sp- F3f *;^ii*fflS©r 5 i<'--fe'ae^-©DNAie 

5>l©?8^**JC«:-?Tl>S (Mayaux,J-F. etal..}.Bac 
teriol . .173.6694. (1991)) 

[000 5] C©J:^fegS*!g)©^^, P-F3y*;^ 
P— F^'?;^ ( Rhodococcus r h o d o 
ch r o u s ) I FO 1 5 5 6 4tt{*il'?/j:^-fe5^4> 

iB5©SJfg©ft:»©fiJffl©J£;'c*i^i®3n-S (Kakeya.H. 

Gt a1 . .Tetrahedoron Lett. ,32,1343,(1991)). 

tf. P- F35,3!7;:;p-Fi'5X I FOl 5 564«j© 

r^'^^l/slOKtC^^SSlB (Kakeva.H..et al.,Che 

m.Lett. ,1823,(1991))*iJ:C/:7'a + 7;Ui&i^g^vp ^ 

- F y Ji'^&?iSimjgai©Jl*4«:Wffl^c*^?SttT 5 F* 
;l'jK>K(C^S"i"SSlD (Yokovaina,M. ,et al.,Tetrahe 
dron lAsynmetry , 4 , 1080 , (1993) )^*^ $> <aS . 

©toc©J:^/3:«ftt<&I«6t|K:^iJ«-r^/cJ6«:tt. JS 

i£*ft8irsaf«©<fc»)-H©*sttr^±, ji!i-¥>was 

SttJ&ri'd^MStvS. 0*^1/. *M©- V y;Ut F^3» 
— tf-?»r 5 d^-4f*Sl>»PI«©i|$tt«:^-rflfe©*®l* 

3fc©»^{cot,>-c«, ^©^3S*sp6*^{c3nri,>^c 

iia^K©w^*i«^ur(,>sjife^i 
^^-f d»>^<d'ax^e«i*^4ffl(,»/c»^©jw^w 

)5c^ll-<*5Jc^«: J: «.?ScS**5 C !Q: ^ C i *sr # <l*>r> 
[0006] 

mmi)miSko^'>friwm] *i6w©awttp- f 

;ftp- Fn -^^^^MiffflSi*©- f- y>'l't F^^— fe'S 

e^^. T 5 i^—fe'jte^Ptej:c;*n6©DN Affile 

[SS5r^^TS/c*©#K] T^^it. Tie ( I ) ~ 
(4) ©«fiK«:*^S. 



3 

( 1 ) mmco^n^n 1 -c^^n^ -J tfxm 

(3) P-K:3^^:^;^M*fflM*^D-K:3-y:^>^ a- K 
i'-^-X ( Rhodococcus rhodochro 
us ) I F O 1 5 5 6 4tt'C^'5t5fBS 1 ^iSiS<Dfie 

( Rhodococcus rhodochro 
us ) I F O 1 5 5 6 4»-C*5BrrtBII2SgBiS<Dite 

[00071 «r. *»9a*i¥«B«:K?8reo n- Kn 
t:;:^:^ a — -Ki7^>< ( Rhodococcus r h 
odochrous ) I F O 1 5 5 6 4»SS&itDN A 
\t. t«l^tfSa i t oh(0:fj& (Saito.H.A Miura,K.,B 
lochim.Biophy.Acta. .72,619,(1963)) ^m^^^X^M't 

^^-^X EMBL3 >'T-t?-^^^-^^^ffil^rfP 

5:5^— fe:ae^cr)^U-^>^«. 04x.t^S a i k i 6 
<DPo lymerase Chain Reactio 
n (PGR) a (Saiki.R.K. et al Science, 230, 13 50 
(1985)) ^ffil>r*5C^j:^Ci;0inr^« C(DB#. Il^ffl 
•r-SPCRfflT/^-^v-K, y^i/fc K^t>— fe' 

K:3 y:^xM'C^*R,. s p . ^ (Mayaux.J 
-F. et a1 . . 3 . Bacten ol . . 173 . 6694 . (1991) ) . R^. _s_ 
jp. N - 7 7 4ftc (Hashimoto,Y. et al. .Biochim.Biop 
hys. Acta. ,1088.225.(1991)) . R,. erythrop 
o 1 i s J CM6 8 2 3*fe (Durban, R. et al.,Biosci. 
Biotech. Biochem., 57, 1323, (1993)) ^^i^^tR. r h 
odochrous J 1 (Kobayashi ,M. et al.,Eur.J. 
Biochem. ,217,327, (1993) )*feJ^ <5:tC^Cirie^j3^e86*^ 
(CStirt^-So ^ft. R. s p^. R. s^. N-7 7 
4^*^cni^. erythropolis J CM6 8 
2 3 i'(D^«f 6. cn^cDT^^- -fe'iae^TSS 

[000 8] ^ur PCRffirf#/c^ h y;bt k^^?- 

tLX^m'f^CtlfCj:^ ^ Rhodococcus 
rhodochrous (R,. rhodochrou 
_s) I FOl 5564»<3[)SSfe(*DNA^>f^^y-*^ 
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4 

NA^1f:/^P'->'ft'r^c<bK:J:or. ^mJnitt^ 

m<ODNA^l{t. S an g e r 6{Cj:^d i d e o X 
y& (Sanqer.F. et al . ,Proc. Natl .Acad, Sci .U.S.A. ,7 
4 , 5463 , (1977) ) tj: ^^ftJCD^ffi^flB C > T -5 C <t 

10 fcmmimdi&mrA(om^j:mmfi^^Ji^xMm 

JMl 09 (pCNH-SE) FERM P-14 
90 1^ 
[0 00 9] 

immm] armmm«-x:ti^m^mwmcfm'r^. 

^feftDNAOiaS 

R,. rhodochrous I F O 1 5 5 6 4*^T^ 

(ommxs o'C(,cx 2 Bm^mofc^ 
mn^ X i. s% 

20 U U A 0. 0 5 9^ 

U>»**— jitjU^A 0. 0 5% 

Watr^:^i^^A 0. 0 5% 

<-XhX:^Xh^ 0. 19e 

B-ti':/a^^^M, 0. 5% 
[0010] e«ftOSft^*®U. tK^^U/cO . 1 M 

<Dij}vty^Amm^ c (omi^^mm u 3 o ^m»^^(ictk 

it* 1 0 m 1 ommm ( 5 0 mM F »J X-iS^S5St 
(pH8. 0) . SOmMEDTA) icmMOfc. CCD 

/eg. $6tCl mg<OTd'n^-^:7*^3^-*fe:*J)a:13 0 
^ra^atcScSU/c. CCDSSSRCCO. 5%SDS^?at 
T. 1 0 0 ag/m 1 (DaSi/jr^^-^CCy^nT^rf— fe* 
K (^^Ui'ttS!) -m5 5'CX^h^i)^(cmm 

c ^ J: r> r^fe* D N A *i^m byt, 
[00 11] sk&t^^-i^^v-oyrjpm 

f#e>ti/c^fettDN A 2 0 u g«:*f LMKBW^S a u 3 
Ai:mi^^xm^mit^i6Crj:^fc. m^. ^feftONA 
40 *4 M g:':5o5*<7)^^-:/tc<t 1 0 0 /x I 

Sa«t?, $IM8mSau3Al (04^S^-->. 2-8 
U/Ml)4TiBO^fl^"CaS»I0 3 7-C^SJCS#fca. 
JJiSa 2 0 mM i /j: 4 <fc 5 E D T A *»n;lSj£*f?± ^ 

Sau3AIISftlS O. iMl Rj6mm2» 



0. 5/A 1 2^)^ 
1/i 1 1» 

1/^1 7» 



50 CCD<t^CCLriHSL/c^fe»DNA(DgiJ»?mb»r>{-«r 



C4) 



-997 3 



x^y-;ua:]^(Cct0^fil5]iRL//cg. CCD^fef*DN 
^fflC^rae. OOOrpm. . l 8B$r^. 15*CCD^ 

fl^ri 0 4 0 %w/w<D t/ a ttSa*JEM'ilsfficc J: o 
Mg^b/Co 1 0-2 0 kb(DDNA»T>T-^#tf^iii*x 

^^-;uit-sccrip|iRb. i o m i <dt Emm(^cmm^ 

i±fc. CCD^<D9 M 1 <t 1 A/ fir<DBamH imftA E 
MBL3"<i^i$?- (X h5^>^->. AEMBL3/B 
amHI VectorKit) i*3 5 0U<DT4D 
NA y fe' (^SjgttSJ) *fflc>r 1 5 /i 1 (Dit-C-^ 

-f^-S^nXy/Co CCr)JgJC&?S4 M 1 CCOC>rG i g a 
packllGo 1 d C^^ h ^^>^->«:Si) ^fflC^r i 
n vi t r o^^•^^'5r— *x>y3g:feC3fe-oA:o C<D/^*•y 

^-i»^mm(Dnm^m^fci^(>c. :w^aP2 39 2 

^fe^^l^r^S?-^^- (tit r e) 9J^^*5C^j:o/c. 
^(Omm. COAEMBL3Slfei*DNA^-r:/^i;-5|c 

R. rhodochrousIFQ1556 4^fe^DNA 



coo 1 2] r.h>;;bfc: K^iS?-4^jS&T*>J:i>'T5^ 

gKOCDR,. AP_- N-7 7 4». R. e r y t h r o p 
s JCM6 8 2 3*fc*><tafR. 



10 oli 



rhodochr 



ou s J 5 ^--fe'iie^^irbSJ^i^b. 01 CD 

r ^ -/KCcmt--^^ D N Ai®ij5&>6^cC^ P C RfflS^:/ 

[0013] 
[01 ] 

[0014] PCRftf3:IUTCDSJ£^ff="C:|»C3^j:o/Co 



AMN- 1 

AMR- 1 :/^>fv- 

dNTP^vg 

1 0 xStS^'^'tyy T'- 



DNA^^ 



9 4-C. 

6 O'C. 

7 2'C. 
4011 



4 5» 
60# 
6 0# 



CCDJ:^«:orff/ciiiSDNA»rK-cD^-^. ^S^ti-^ 
*^0. 7 kb<DBrK**0iRl/v T4DNA^i<';-;'^— fe' 
CC J: O^^f^^fci T 4 y 5? ^ U:t ^ K+:^— «^CC<fc 

0 •J>®^fc4iiS^:feCi^cC-:>/Co CCDDNASrK-^pUC 

1 1 8(DSma lSPffiiaBL//Co C(Dffl»^f*:?'^X 
5 FcD^SADNABr^T-tCoi^r. d i d e o xyffiCCJ: 
ODNAIB?)J^^b. f^iR$n/cr^yKi^J)&^K*D 

^W:f'y:^%Y^%\^. Sy^(D:7-^>r'7-AMN- l. 
AMR- l^ffll^rSJ^-pScD^fftC^fc-SPCR^^sC 
J^jrir^ ^cD^A^tl/ci^O. 7 kbDNA»f>tOif$i^ 
^^tSi-yic. CCDDNAK>i-?rBcaBEST^-<y> 

^fcC/cCofCo C<DSSiDNA»r>T-*;^^'y-^>yffl 

[0 0 15] ^HUJUtK^^— tfafe^feiO^T^^ 

lutB-^ ^ -fe- h«<Dift 1 0 ' p f u^<D:7 T -i^SSf^ 
<b. 2 8-C-C-%^SLO. D. ,00 = 0. hLtl^^ 
^K: 1 0mM<DMgSO,r^U/cP2 3 9 2*2 0 



30 



40 



50 



1 

1 0 0 pmo 1 

1 0 0 pmo 1 
SI mM 

1 0 M 1 

2. 5Un i t 
ftl 0 0 M 1 

Om 1 i^S^U/Co CCD^^3 7-a X^^fAAly 
+ *-^-hL/ct*. 0. 7%r;«7P-X*3ml»P;i 
/Co 1. 5%r;0'u-;^LBSife (I 0%>'^'^ F M;^!^ 

3) . 5%NaC 1 ) <D:/U— F (9 cmiS®) ^ 
U-h^/cf] 1 OOOO-'l 5 OOOy^-^^J^-SJ: 
-^tcccD^S^^^. r^^-^cD-et/cy^'U-h^ffSf 
UAio C<Dr/u-hCc»L. Hujeuyt ["P3«ai:/o 
--:/*fflC^TManiatis^CDMolecu1ar Ooninq (Cold S 
pring Harbor Laboratory. N.Y. (1982) )feig<D:6'ffiCCae 

/Co ^CD^m, W^^Vy-'J>':f^-^\QOQm^tL 

0 l-'2fflcDrai4i^^:^;i/^t&/Co m^l^^'r)\^^^t 

Sj^fi^W:4ffl(Diffl^^:7T-$;^^*«^tL/. Maniatis6<D 
htolecular Cloning (Gold Spring Harbor Laboratory, 
N. Y. OBS2^WM<Dl5WiCU oT^r-i^DNA 5:1^ 
L/Co CCDiai!{U/c:7r-^i>DNA*a^CD*fMffl«* 
jDArrB-fbU. lMf>:/n ^f*^ >:5^^ (southern. 
E.M..3.M01. Biol.. 98, 503(1975)) ^fcC&o/to ^0 

SSI:/P-:^CCJ:or«ia3nyfciK9kb<DEco 
R nWtDNA»r>t*W-r:7T->^i5'n->(coiiT. 
3 6«:i¥ffl/j:$iMSacci:S»«f€r*jcy5:o/Co -ecDS 
*C<D7y-S^^o->ic«. iKil 5 kb<3!)DNABr>^■ 
>^)5^fA^nrl>^C<fc*^«?3L/co ^cD$iMS^fl&0* 
a2CC7nl//Co $6{C. CCD»ASn/c*5)l 5kb<DD 



(5) 



#ia¥9-9 97 3 
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* [0018] 

coo 1 9] 
1 

^JCOS^ : 1 5 6 6 
lB5U<0«ffl : G e n o m i c D N A 



4Jg5« :P-Kr7y:^x o-K^^X(Rhodoc 
occus rhodochrous) 
: I FOl 5564 



: 1 . . 15 6 6 
mk^^MLtc:^m : E 



NA»f>ttC-:)C^rit'-tf>»*f«rfeC:^d:C\ -ecDJSm. E 
c oR I - S a c im^me k b DNAmt^i^^y'o-:^ 
icj:ortfem$ti/c(or. ccddnak^t-^pUCI 9 

<DSacI-EcoRl SSiSCCiiAOfco f#e>n/cffljfe 
K*p C NH - S E <b L/to 

[0016] 
[02] 

[0017] pCNH-SEcDDNAK^iJ(7)i^^ 
pCNH-SE<DDNAiS3^J(7)9^^^Bc a BESTt^ 
-^5^x>i/>y+^:. h (SaSttS) 0:i?ffiCCS£":>r*5 10 

2 1 r ^ Kf^^-^v*;--?"^ >^yiy- 

A7j)5r 5 fe'ae^-^:^-- FT'E>DNAiE?'Jr^^C 

2 0 7T^^K*3- ¥'r^^-y>')-f^>^ 

:7U-A3&«^h yjUfc F'^-:S'-^2f(Z>(x1f::^JLjL5f hit 
K-r€»DNAi25^jT^^C<h. 2 12ra>' 

NAga^J-C^^Ctditos&^o/Co 56iC. ^hUJUbF 20 

--y^'i>^y ACDTSS^CH ashimoto^(Ha 
shimotD.Y.et al .,Biosci .Biotech. Biochem. ,58,1859, 
(l994))tcJ:or*$nj^^ h »;;l/b F*^^— fe'OrStt 
^jcBa-^T-SH-T-^^i- F-r-SSg 9r 5 >^K<fcD>^cC 

1 ATG GCG ACA ATC CGA GCT G/VC GAC AAT CiCA ATA GAC ACC GCC GCA 
Met Ala Thr lie Arg Pro Asp Asp Asn Ala lie Asp Thr Ala Ala 

10 

46 AAG CAT TAG GGC ATC ACT CTC GAC CAA TCA OCC COG CTC GAG TGG 
Lys his Tyr Gly He Thr Leu Asp Gin Ser Ala Arq Leu Clu Trp 
20 30 
91 GCG GCA CTG ATC GAC OGA GCA CTG GGC TCC TAC GAC GTC CTC GAC 
Pro Ala Leu He Asp Gly Ala Leu Gly Ser Tyr Asp Val Val Asp 

40 

136 GAG rrc TAC GCC GAC GAC GCA ACC CCG CCG ACC ACG TCA CGT GAG 
Cln Leu Tyr Ala Asp Glu Ala Thr Pro Pro Thr Thr Ser Arq Glu 
50 60 
181 CAC AGG CTC CCA ACA C3CC AOC GAA AAT CCnrC AGC GCT TOG TAT 
His Thr Val Pro Thr Ala Ser Glu Asn Pro Leu Ser Ala Trp Tyr 

70 

226 GTG AOC ACA AGC ATC QiQ CCG ACQ TCG GAC GCXI CTC CTG ACC GGC 
Val Thr Thr Ser He Pro Pro Thr Ser Asp Cly Val Leu Thr Gly 
80 90 
271 GGA COC CTG GCC ATC AAC OCA AAC GTG ACC CTG GGC GGA CFT CCG 
Arg Arg Val Ala He Lys Asp Asn Val Thr Val Ala Gly Val Pro 

100 

316 ATC ATC AAC GOG TCr 000 ACA CTA GAC GOG TTC ACT CCG TCT COC 
Mec Met Asn Gly Ser Arg Thr Val Clu GTy Phe Thr Pro Ser Arg 
HO 120 



45 



90 



135 



180 



225 



270 



315 



360 
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9 10 
361 CAC GCC ACT CTG ATC ACT CGV CPA CTC GCC QIC GCT CCA ACC CTC 405 
Asp Ala Thr Val lie Ihr Art) Leu Leu Ala Ala Gly Ala Thr Val 

130 

406 CEG GQC AAA GCT CTG TCr OSC GAC CTG TCT TTC T(X GCT TCG AOC 450 
Ala Gly Lys Ala Val Cys Glu Asp Leu Cys Phe Ser Gly Ser Ser 
140 150 
451 TTC ACA CCG GCA AGC GGA COG CTC CGC AAT (DCA TGG GAC CCA CAG 495 
Phe Thr Pro Ala Ser Gly Pro Val Arq Asn Pro Trp Asp Pro Gin 

160 

496 GCT GAA GCA GCT GGA TCA TCC GCT GGC ACT GCG GCT CTC CTC GCA 540 
Arq Glu Ala Gly Gly Ser Ser Gly Gly Ser Ala Ala Leu Val Ala 
170 180 
541 AAC OCT CAC CTC GAT TTT GOC ATC GGC CGC GAT CAC GCT GGA TCG 585 
Asn CTy Asp Val Asp Phe Ala He Gly Gly Asp Gin Gly Gly Ser 

190 

586 ATC COG ATC CCG OCG GCA TTC TOC GGC CTC CTC GGA CAC AAC CCG 630 
He Arc? He Pro Ala Ala Phe Cys Gly Val Val Gly His Lys Pro 
200 210 
631 ACG TTC GGG CTC CTC CCG TAT ACC GCT GCA TTT CCC ATC GAG CCA 675 
Thr Phe Gly Leu Val Pro Tyr Thr Gly Ala Phe Pro He Glu Arq 

220 

676 ACA ATC GAC CAT CTC GGC COG ATC ACA CGC ACG CTC CAC GAT GCC 720 
. Thr He Asp His Leu CTy Pro He Thr Arq Thr Val His Asp Ala 
230 240 
721 OCA CTG ATC CTC TCG CTC ATC GCC GCT CGC GAC GCT AAC GAC CCA 765 
Ala Leu Met Leu Ser Val He Ala CTy Arq Asp CTy Asn Asp Pro 

250 

766 GGC CAA GCC GAC AGC CTC GAA CCA GCT GAC TAT CTC TCC ACC CTC 810 
Arq CTn Ala Asp Ser Val CTu Ala Gly Asp Tyr Leu Ser Thr Leu 
260 270 
811 GAC TCC GAT CTG GAT GCT CTG CGA ATC GGG ATC CTT CGA GAA GCT 855 
Asp Ser Asp Val Asp CTy Leu Arq He Gly lie Val Arq Glu Gly 

280 

856 TTC GOG CAC GCG CTC TCA CAC CCC GAG CTC GAC GAC GCA CTC CGC 900 
Fhe CTy His Ala Val Ser Gin Pro CTu Val Asp Asp Ala Val Arq 
290 300 
901 OCA GOC GCA CAC ACT CTG TOC GAA ATC GCT TGC AGG CTA GAC GAA 945 
Ala Ala Ala His Ser Leu Ser Glu He Gly Cys Thr Val Glu Glu 

310 

946 CTA AAC ATC CCG TOG OVC CTA CAT GCT TTC CAC ATC TCG AAC CTG 990 
Val Asn He Pro Trp His Leu His Ala Phe His He Trp Asn Val 
320 330 
991 ATC GOC ACG GAC GCT OCT GOC TAG CAC ATC TTC GAC GGC AAC GGA 1035 
He Ala Thr Asp CTy Gly Ala Tyr Gin Met Leu Asp Gly Asn Gly 

340 

1036 TAC GOC ATC AAC GCC GAA GCT TTC TAC GAT OCG GAA CTG ACG GCA 1080 

Tyr Gly Met Asn Ala C\u Gly Leu Tyr Asp Pro Glu Val Thr Ala 
350 360 
1081 CAC TTT GCT TCT CGA GOC CTT CAG CAC GCC GAC GCT CTG TCC GAA 1125 

His Phe Ala Ser Arq Arq Leu Gin His Ala Asp Ala Leu Ser Glu 
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n 12 

370 

1126 ACC CrC AAA CTC GTG OCT CTG ACC OGC CAC CAC GOC ATC ACC ACC 1170 
Thr Val Lys Leu Val Ala Leu Thr Cly His His Cly lie Thr Thr 
380 390 
1171 CTC CXjC GCX: GCG AOC TAG GOC AAA GCC CCX; AAC CTC GTA CCG CTC 1215 
Leu Cly Cly Ala Ser Tyr Gly Lys Ala Arg Asn Leu Val Pro Leu 

400 

1216 OCC COC GCC GCC TAG G^C ACT GCC TTG AGA CAA TTC GAG CTC CTG 1260 
' Ala Arg Ala Ala Tyr Asp Thr Ala Leu Arg Cln Phe Asp Val Leu 
410 420 
1261 CTG ATG CCC ACA CTG CTC TAG CTC GCA TCC GAA CTA CCG GCC AAC 1305 
Val Met Pro Thr Leu Pro Tyr Val Ala Ser Clu Leu Pro Ala Lys 

430 

1306 OVC CrC GAT CGT OCA ACC TTC ATC ACC AAC Oa CTC GOG ATG ATC 1350 
Asp Val Asp Arg Ala Thr Phe He Thr Lys Ala Leu Gly Met He 
440 450 
1351 OCC AAC ACA GCA CCC TTC CAC CTG ACC GGA CAT COC TCC CTC TCC 1395 
Ala Asn Thr Ala Pro Phe Asp Val Thr Gly His Pro Ser Leu Ser 

460 

1396 CTT CCC GCC CCC CTG CTC AAC GCC CTT CCG CTC GGA ATG ATG ATC 1440 
Val Pro Ala Cly Leu Val Asn Gly Leu Pro Val Gly Met Met He 
470 480 
1441 ACC GOC AAC ACC TTC G^ GAT GCG ACC CTC CTC COG GTC GOG CGC 1485 
Thr Gly Lys Thr Phe Asp Asp Ala Thr Val Leu Arg Val Gly Arg 

490 

1486 OCA TTC GAA AAC CTT OGC GOC GCG TTT CCG ACC CCT GCC GAG CAC 1530 
Ala Phe Clu Lys Leu Arg Gly Ala Phe Pro Ihr Pro Ala Asp His 
500 510 
1531 ATC TCC GAC TCT GQ\ GCA CAA CTC ACC CTC ACC TAG 1566 
lie Ser Asp Ser Ala Pro Gin Leu Ser Leu Thr 

520 
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1 ATC TCA CTA ACC ATC GftC CAC ACA ACC GAG AAC GOC GCA CCC GCC 45 
Met Ser Val Thr He Asp His Thr Thr Clu Asn Ala Ala Pro Ala 

10 

46 OVC GOC CCC CTC TCC GAT CGC GCG TOG OCC CTG TTC CGC OCA CTC 90 
Gin Gly Pro Val Ser Asp Arg Ala Trp Ala Leu Phe Arg Ala Leu 
20 30 
91 GVC OCT AAC GGA TTG CTA COC GAC OCT TAG CTC GAG GGA TOG AAC 135 
Asp Cly Lys Gly Leu Val Pro Asp Cly Tyr Val Clu Cly Trp Lys 

40 
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13 14 
136 MG ACC TTC GAG GAG G^C TTC ACT CCA AGG CGC GQ\ GCG GAA TTG 180 
Lys Thr Ftie Clu Glu Asp Phe Ser Pro Arg Arg Cly Ala Glu Leu 
50 60 
181 CrC GOG CGG OCT TGG ACC GAC CCC GAT TTC CGG CAA CTG CTT CTC 225 
Val Ala Arq Ala Trp Thr Asp Pro Asp Phe Arg Gin Leu Leu Leu 

70 

226 ACC GAC OCT ACC GCC GCC err GCC GAG TAC OCA TAT OT OGC CCC 270 
Ihr Asp Cly Thr Ala Ala Val Ala Gin Tyr Cly Tyr Leu Gly Pro 
80 90 
271 CAG OGC GAA TAC ATC CrC OCA CTC GAA GAC ACC COG ACC CTC AAC 315 
Gin Cly Clu Tyr He Val Ala Val Glu Asp Thr Pro Thr Leu Lys 

100 

316 AAC ore ATC CTG TOG TCC CTC TCT TCA TOC ACC GOG TOG CCC ATT 360 
Asn Val He Val Cys Ser Leu Cys Ser Cys Thr Ala Trp Pro lie 
HO 120 
361 CTC OGC CTC CCC CCT ACC TOC TAC AAG ACT TTC CAA TAC COT GCC 405 
Leu Cly Leu Pro Pro Thr Trp Tyr Lys Ser Phe Glu Tyr Arg Ala 

130 

406 OGA ore CTC COT GAG CCA COG AAC CTT CTC TCC GAC ATC CCA ACC 450 
Arg Val Val Arg Clu Pro Arg Lys Val Leu Ser Glu Met Gly Thr 
140 150 
451 GAG ATC GCG TCG GAC CTC GAC ATC CGC CTC TAC GAC ACC ACC GCC 495 
Clu He Ala Ser Asp Val Clu He Arg Val Tyr Asp Thr Thr Ala 

160 

496 CAA ACT CGC TAC ATC CTT CTC CCC CAA CCT GCC CCA GCC ACC GAA 540 
CTu Thr Arg Tyr Met Val Leu Pro Gin Arg Pro Ala CTy Thr Glu 
170 180 
541 OGC TOG AGC GAG GAA CAC or CAA GAG ATC CTC ACC AAG GAC TCC 585 
CTy Trp Ser CTn Glu CTn Leu Gin Clu He Val Thr Lys Asp Cys 

190 

586 CTC ATC GCC CTC CCA CTC CGG CAG CTC CCC ACC CTC TGA TCACCCCC 632 
Leu He Cly Val Ala V^l Pro Gin Val Pro Thr Val *** 
200 



633 AC AACAAAGAAC CACACC ATC GAT GGA OTA CAC GAT CTT OGC OGA CHT 680 
Met Asp CTy Val His Asp Leu Ala Gly Val 

10 

681 CAA CGC TTC OGC AAA CTC CGC CAT ACC CTC AAC OGC GAC ATC OGC 725 
Gin CTy Phe Cly Lys Val Pro His Thr Val Asn Ala Asp He Gly 

20 

726 GCC AGC TTC CAC GCC GAG TOG GAA CAC CTG GCG TAC AGC CTG ATC 770 
Pro Thr Phe His Ala CTu Trp Glu His Leu Pro Tyr Ser Leu Met 
30 40 
771 TTC OGC OCT CTC GCC GAA CTC GGG OCA TTC AX CTC GAC GAA CTT 815 
Phe Ala CTy Val Ala CTu Leu Cly Ala Phe Ser Val Asp Clu Val 

50 

816 GGA TAC CTC CTC GAC GOG ATC GAA CCA CGC CAC TAC ATC ATC ACC 860 
Arg Tyr Val Val CTu Arg Met Clu Pro Arg His Tyr Met Met Thr 
60 70 
861 OCC T/C TAC GAG AGO TAC CTC ATC OGC CTC GCC AGG CTG ATC CTC 905 



(9) 

.15 16 
Pro Tyr Tyr Clu Arq Tyr Val He Cly Val Ala Thr Leu Met Val 

80 

906 AAG OCA ATC CTG /CG CAG GAA GAA CTC G\A AOC CTT CCA CGG 
du Lys Cly lie Leu Thr Gin Clu Clu Leu Clu Ser Leu Ala Cly 
90 100 
951 GGA CCC TTC CCA CTG TCG COG CCC AGC GAA TCC GAA GGG CCT CCG 
Cly Pro Phe Pro Leu Ser Arq Pro Ser Clu Ser Clu Cly Arp Pro 

no 

996 CCA CGC crc GAG ACC ACC AOC TTC GAA ATC GGT OV:; CCT CTA CCC 
Ala Pro Val Clu Thr Thr Thr Phe Clu He Cly Gin Arq Val Arq 
120 130 
1041 crc COC GAG GAG TAG GTT CCE GOG CAT ATT C3GA ATG CCT CCG TAC 
Val Arq Asp Clu Tyr Val Pro Cly His He Arq Met Pro Ala Tyr 

140 

1086 TGC COC GGA CCA crc GGA AOC ATC TCT CAT OXI ACT ACC GAG AAG 
Cys Arq Cly Arq Val Cly Thr He Ser His Arq Thr Ser Clu Lys 
150 160 
1131 TCG COG TTT CCC GAC OCA ATT GGC CAC GGG CCC A^ GAG CCC CGC 
Trp Pro Phe Pro Asp Ala He Cly His Cly Arq Asn Asp Ala Cly 

170 

XL76 GAA GAA CCC ACC TAC CAC CTG AAC TTC CCC OCC GAG GAA TTG TTC 
Clu Clu Pro Thr Tyr His Val Lys Phe Ala Ala Glu Clu Leu Phe 
ISO 190 
1221 OCT AOC CAC ACC CAC OGC COC AGC CTC CTA CfC CAC CTC TTC GAG 
Cly Ser Asp Thr Asp Cly Cly Ser Val Val Val Asp Leu Phe Glu 

200 

1266 OCT TAC CTC GAG CCT GCG GOC TGA 
Cly Tyr Leu Clu Pro Ala Ala 
210 
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111 /OCTACTTCCXXACXTrCT CACCACCTCG AAAAOCTCTA CCAACAAOX; CCTTCC 166 
167 ATTC CATCGACCQVTTCTGCrCGC TGAATCACGC CCTOGCCCXX: TCTACCCOCC T 221 
222 TCTCrCTGA QCGCGCjCn'AAOCCGAACTTA ACGACTCAAT ATCTCGATAC CTATTGA 277 
278 OQC AATTATCJGAT GCGGCCJCrACTCTGAAACAC AAGTGVAOCC GATCACATCA CXi 332 
333 ACjCACACT TCTC ATC CXIG ACA ATC CCA CCT GAC CAC AAT OCA ATA 377 
Met Ala Thr lie Arq Pro Asp Asp Asn Ala lie 

10 

378 CAC ACC CCC CCA AAG CAT TAC CCC ATC ACT CTC GAC CAA TCA GCC 422 
Asp Thr Ala Ala Lys ttis Tyr Gly He Thr Leu Asp Gin Ser Ala 

20 

423 GGG CTC GAG TGG CCG OCA CFG ATC GAC GGA OCA CTG GGC TCC TAC 467 
Arg Leu Glu Trp Pro Ala Leu lie Asp Gly Ala Leu Gly Ser Tyr 
30 40 
468 GAC CrC CTC GAC CAG TTG TAC GCC GAC GAG OCA A(X CCG CCG ACC 512 
Asp Val Val Asp Gin Leu Tyr Ala Asp Glu Ala Thr Pro Pro Thr 

50 

513 ACC TCA CCT GAC CAC ACC CTG CCA ACA GCC ACC GAA AAT CCT TTG 557 
Thr Ser Arq Glu His Thr Val Pro Thr Ala Ser Glu Asn Pro Leu 
60 " 70 

558 ACC CCT TGG TAT CTG ACC ACA AGC ATC CCG GCC ACC TCC GAC GCC 602 
Ser Ala Trp Tyr Val Thr Thr Ser lie Pro Pro Thr Ser Asp Gly 

80 

603 CTC CTG ACC GGC CGA OX CTG GCC ATC AAG GAC AAC CTG ACC CTG 647 
Val Leu Thr Gly Arq Arq Val Ala He Lys Asp Asn Val Thr Val 
90 100 
648 GCC GGA err CCC ATG ATC AAC GGG TCT COG ACA CTA GAG GGG TTC 692 
Ala Gly Val Pro Met Met Asn Gly Ser Arq Thr Val Glu Gly Phe 

OJLO 

693 ACT COG TCT CCC GAC GCC ACT CTG ATC ACT GGA CTA CTG CCG GCC 737 
Thr Pro Ser Arq Asp Ala Thr Val He Thr Arq Leu Leu Ala Ala 
120 130 
738 OCT OCA AOC CTC GCC OX AAA OCT CTG TCT GAC OSC CTC TCT TTC 782 
Gly Ala Thr Val Ala Gly Lys Ala Val Cys Clu Asp Leu Cys Phe 

140 

783 TCC GCT TCC AGC TTC ACA CCG OCA AGC GGA OCG CTC CGC AAT CCA 827 
Ser Gly Ser Ser Phe Thr Pro Ala Ser Gly Pro Val Arq Asn Pro 
150 160 
828 TGG GACCCACAGCCTONAOCAGCTOGATCATCCGCTGOCACTOOG 872 
Trp Asp Pro Gin Arq Clu Ala Gly Gly Ser Ser Gly Gly Ser Ala 

170 

873 OCT CTC CTC CCA AAC GCT GAC CTC GAT TTT OCC ATC GGC GGG GAT 917 
Ala Leu Val Ala Asn Gly Asp Val Asp Phe Ala He Gly Gly Asp 
180 190 
918 CAC GCT OCA TOG ATC OSG ATC CCG GCC GCA TTC TOC GGC CTC CTC 962 
Cln Gly Gly Ser He Arq He Pro Ala Ala Phe Cys Gly Val Val 

200 

963 OGA OC AAC CCG ACC TTC GOG CTC CTC CCC TAT AOC OCT OCA TTT 1007 
Cly His Lys Pro Thr Phe Gly Leu Val Pro Tyr Thr Gly Ala Phe 
210 220 
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19 20 
1008 CCC ATC GAG CGA ACA ATC GAC CAT CTC GCC CCG ATC ACA CGC ACC 1052 
Pro He Clu Arg Thr lie Asp His Leu Gly Pro lie Thr Arg Thr 

230 

1053 GTC OVC GAT GCX: OCA CFG ATG CTC TCG CTC ATC GGC GCT CGC GAC 1097 
Val His Asp Ala Ala Leu Met Leu Ser Val He Ala Gly Arg Asp 
240 250 
1098 OCT A«: GAC CCA CGC OVA GGC GAC ACC GTC 0\A OCA GCT GAC TAT 1142 
Gly Asn Asp Pro Arg Gin Ala Asp Ser Val Clu Ala Gly Asp Tyr 

260 

1143 CTG TCC ACC CTC GAC TCC GAT CTG GAT GOT CTG CGA ATC OOC ATC 1187 
Leu Ser Thr Leu Asp Ser Asp Val Asp Gly Leu Arg He Gly He 
270 280 
1188 err CGA GAA GCT TTC GOG CAC GCG CTC TCA CAG CCC GAC CTC GAC 1232 
Val Aixi Glu Gly Phe Gly His Ala Val Ser Gin Pro Clu Val Asp 

290 

1233 GAC OCA CTC COC OCA GCG OCA CAC ACT CTG TCC GAA ATC OCT TOC 1277 
Asp Ala Val Arg Ala Ala Ala His Ser Leu Ser Glu He Gly Cys 
300 310 
1278 ACC CTA GAG GAA CTA AAC ATC CCG TGG CAC OA CAT GCT TTC CAC 1322 
Thr Val Glu Clu Val Asn He Pro Trp His Leu His Ala Phe His 

320 

1323 ATC TGG AAC CTG ATC OCC AOG GAC GCT GCT GCC TAC CAG ATG TTG 1367 
He Trp Asn Val He Ala Thr Asp CTy Gly Ala Tyr Gin Met Leu 
330 340 
1368 GAC GGC AAC GGA TAC GGC ATG AAC GCC GAA OCT TTG TAG GAT CCG 1412 
Asp CTy Asn Gly Tyr CTy Met Asn Ala Glu CTy Leu Tyr Asp Pro 

350 

CTu Val Thr Ala His Phe Ala Ser Arg Arg Leu Gin His Ala Asp 
360 370 
1458 OCT CTG TCC GAA ACC CTC AAA CTG CTG GCT CTG AOC GGC CAC CAC 1502 
Ala Leu Ser Glu Thr Val Lys Leu Val Ala Leu Thr CTy His His 

380 

1503 OCX: ATC ACC ACC CTC GGC GOC CCG AOC TAC OOC AAA GCC COG AAC 1547 
Cly He Thr Thr Leu Cly Gly Ala Ser Tyr CTy Lys Ala Arg Asn 
390 400 
1548 CTC CTA CCG CTC GCC OOC OOC GCC TAC GAC ACT GGC TTG AGA CAA 1592 
Leu Val Pro Leu Ala Arg Ala Ala Tyr Asp Thr Ala Leu Arg Gin 

410 

1593 TTC GAC CTC CTG CTC ATC COC ACA CTC OCX TAC CTC CCA TCC CAA 1637 
Phe Asp Val Leu Val Met Pro Thr Leu Pro Tyr Val Ala Ser Glu 
420 430 
1638 CTA CGG OCC AAC GAC CTC GAT CCT GCA ACC TTC ATC ACC AAC GCT 1682 
Leu Pro Ala Lys Asp Val Asp Arg Ala Thr Phe He Thr Lys Ala 

440 

1683 CTC CGG ATC ATC CCC AAC AOV GCA GIG TTC CAC CTC ACC GGA CAT 1727 
Leu Cly Met He Ala Asn Thr ATa Pro Phe Asp Val Thr Cly His 
450 460 
1728 OCC TOC CTG TCC CTT OCT GGC GOC CTC CTC AAC GOC CTT CCC CTC 1772 
Pro Ser Leu Ser Val Pro Ala Gly Leu Val Asn CTy Leu Pro Val 

470 
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1773 C3GA ATC ATG ATC ACC OQC AAC AOC TTC GAC CAT COG ACC CTC CTC 1817 
Cly Met Met lie Thr Cly Lys Thr Phe Asp Asp Ala Thr Val Leu 
480 490 
1818 GGG CTC COG CGC CCA TTC GAA AAC CTT CGC OOC OOC TTT CCC ACC 1862 
Arq Val Cly Arg Ala Phe Glu Lys Leu Arp Cly Ala Phe Pro Thr 

500 

1863 GCT CGC GAC CAC ATC TCC GAC TCT GCA CCA CAA CTC AGC CTC ACC 1907 

Pro Ala Asp His He Ser Asp Ser Ala Pro Gin Leu Ser Leu Thr 
510 520 
1908 TAG TTCTCTATCC GCACTPQCAC AACAAATTCC ACCGATTCAC ACATGATCAG C 1961 

*** 

1962 GCGCATAGA AAAOCTGAAC CAC ATC TCA GTA ACG ATC GAC CAC ACA ACC 2010 

Met Ser Val Thr He Asp His Thr Thr 
2011 GAC AAC GCC GCA CCG GCC CAC GGG CCC CTC TCC GAT CGC GCG TGG 2055 
Glu Asn Ala Ala Pro Ala Gin Cly Pro Val Ser Asp Arg Ala Trp 
10 20 
2056 GCC CTC TTC CGC GCA CTC CAC GCT AAG GGA TTG CTA CCC GAC OCT 2100 
Ala Leu Phe Arg Ala Leu Asp Gly Lys Cly Leu Val Pro Asp Cly 
30 

2101 TAC CTC GAG GGA TGG AAC AAC ACC TTC GAC GAC GAC TTC ACT CCA 2145 
Tyr Val Glu Gly Trp Lys Lys Thr Phe Glu Glu Asp Phe Ser Pro 
40 50 
2146 ACG COC GGA GCG GAA TTG CTC GCG CGC OCT TGG AGC GAC CCC GAT 2190 
Arq Arq Gly Ala Glu Leu Val Ala Arq Ala Trp Thr Asp Pro Asp 
60 

2191 TTC COG CAA CTC CTT CTC AGC CAC GGT ACC OOC GOG CTT CGC CAC 2235 
Phe Arq Gin Leu Leu Leu Thr Asp Gly Thr Ala Ala Val Ala Gin 
70 80 
2236 TAC GGA TAT CTT GOC CCC CAC CGC CAA TAC ATC CTC GCA CTC CAA 2280 
Tyr Cly Tyr Leu Gly Pro Gin Gly Glu Tyr He Val Ala Val Glu 
90 

2281 GAC AOC COG ACC CTC AAG AAC CTC ATC OTC TOC TOG CTC TGT TCA 2325 

Asp Thr Pro Thr Leu Lys Asn Val He Val Cys Ser Leu Cys Ser 

100 1100 
2326 TOC AGC GCC TOG CCC ATT CTC OOC CTC CCC GCT AGC TGG TAC AAC 2370 

Cys Thr Ala Trp Pro He Leu Cly Leu Pro Pro Thr Trp Tyr Lys 
120 

2371 ACT TTC GAA TAC CCT QCC CGA CTC CTG CGT GAG CCA COG AAG CTT 2415 

Ser Peh Glu Tyr Arq Ala Arq Val Val Arq Clu Pro Arq Lys Val 

130 140 
2416 CTC TOC GAC ATG GGA ACC CAC ATC GCG TCG GAC CTC GAG ATC CGC 2460 

Leu Ser Clu Met Gly Thr Glu He Ala Ser Asp Val Glu He Arq 
150 

2461 CTC TAC CAC ACC ACC GCC CAA ACT CGC TAC ATG CTT CTC CCC CAA 2505 

Val Tyr Asp Thr Thr Ala Glu Thr Arq Tyr Met Val Leu Pro Gin 

160 170 
2506 GGTCOCOCAGOCACCGAAGOCTCCAGCCACCAACACCTTCAACAG 2550 

Arq Pro Ala Gly Thr Clu Gly Trp Ser Gin Clu Gin Leu Gin Glu 
180 

2551 ATC CTC ACC AAC GAC TOC CTC ATC OGC CTC OCA CTC CCG CAC CTC 2595 
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23 24 
He Val Thr Lys Asp Cys Leu lie Cly Val Ala Val Pro Gin Val 
190 200 
2596 GCC ACC CTC TGA TCACCXZCGACA AGAMCAACC ACACC ATC GAT OCA 2642 

Pro Thr Val *** Met Asp Cly 

2643 CTA CAC GAT or GCC OGA CTT CAA GCC TTC OGC AAA CTC CCG CAT 2687 
Val His Asp Leu Ala Gly Val Gin Gly Phe Gly Lys Val Pro His 
10 

2688 ACC cue AAC GCC GAC ATC GOC CCC ACC TTC CAC COC GAG TGG GAA 2732 
Thr Val Asn Ala Asp He Gly Pro Thr Phe Kis Ala Glu Trp Glu 
20 30 
2733 CAC CTG CCG TAC AGC CTG ATG TTC GCC GGT CTC GOC GAA CTC GGG 2777 
Kis Leu Pro Tyr Ser Leu Met Phe Ala Gly Val Ala Glu Leu Gly 
40 

2778 OCA TTC AGC CrC GAC G^ CTT CGA TAC CTC CTC GAG CGG ATG GAA 2822 
Ala Phe Ser Val Asp Glu Val Arg Tyr Val Val Glu Arg Met Glu 
50 60 
2823 CXIA COC CAC TAC ATG ATG AOC CCG TAC TAC GAG AOC TAC CTC ATC 2867 
Pro Arg His Tyr Met Met Thr Pro Tyr Tyr Glu Arg Tyr Val He 
70 

2868 OGC CTC CCG ACC CTG ATG CTC GAA AAG OGA ATC CTG ACC CAC GAA 2912 
Gly Val Ala Thr Leu Met VaT Glu Lys Gly He Leu Thr Gin Glu 
80 90 
2913 GAA one GAA ACC err CCA GOC GGA CCG TTC OCA CTC TCG CGG CCC 2957 
Clu Leu Glu Ser Leu Ala Gly Gly Pro Phe Pro Leu Ser Arg Pro 
100 

2958 AOC GAA TCC GAA COG OCT COG OCA CCC ore GAG ACG ACC ACC TTC 3002 
Ser Glu Ser GTu Cly Arg Pro Ala Pro Val Clu Thr Thr Thr Phe 
HO 120 
3003 CAA ATC OCr CAC CCr CTA COC CrO COC GAC GAG TAC GTT CCG GGG 3047 
Clu lie Gly GTn Arg Val Arg Val Arg Asp Clu Tyr Val Pro Gly 
130 

3048 CAT ATT CGA ATG CO" OCC TAC TOC COC OGA GGA CTC OGA ACC ATC 3092 
His He Arg Met Pro Ala Tyr Cys Arg Gly Arg Val Gly Thr He 
140 150 
3093 TCT CAT COG ACT AGC Q\C AAC TGG CCG TTT OCC CAC OCA ATT GOC 3137 
Ser His Arg Thr Ser Glu Lys Trp Pro Phe Pro Asp Ala He Gly 
160 

3138 CAC GOG COC AAC CAC OCC COC GAA GAA CCG ACC TAC CAC CrC AAG 3182 
His Gly Arg Asn Asp Ala Gly Glu Glu Pro Thr Tyr His VaT Lys 
170 180 
3183 TTC GOC OCC GAG GAA TTG TTC OCT AGC GAC ACC GAC GOC GOC AOC 3227 
Fhe Ala Ala Glu Glu Leu Phe Cly Ser Asp Thr Asp Gly Gly Ser 
190 

3228 CTC CTA CTC GAC CTC TTC GAC OCT TAC CTC GAC OCT OCG GCC TGA 3272 
Val Val VaT Asp Leu Phe Glu Gly Tyr Leu CTu Pro Ala Ala 
200 210 
3273 TCCTCCAACA TTCCOGGCOG CGGTCACOOC ATCACAOOCT TTCGCGICAC GGCCG 3327 
3328 OCTGA TCACAOCAAT TCTCTCATTC OGAAGGACAC TOGAAATC ATC CTC CAC 3379 

Met Val Asp 

3380 ACA CGA CTT CCG CTC ACC CTC CTC TCA OCT TTC CTC GOC GCC GGG 3424 



(14) 

25 26 
Thr Arq Leu Pro Val TTir Val Leu Ser Gly Phe Leu Gly Ala Gly 
10 

3425 AAG ACG ACA CTA CTC AAC GAG ATC CTC CGA AAT COG GAG GGC CGC 3469 
Lys Thr Thr Leu Leu Asn Clu He Leu Arq Asn Arq Glu Gly Arq 
20 30 
3470 GGG GTT GCG GTG ATC CTC AAC GAC ATG AGC GAA ATC AAC ATC GAC 3514 
Arq Val Ala Val He Val Asn Asp Met Ser au lie Asn lie Asp 
40 

3515 ACT GCA GAA CTC GAG CCT GAC ATC TCG CTC ACT CGC TCC GAG GAG 3559 
Ser Ala Glu Val Glu Arq Glu He Ser Leu Ser Arq Ser Glu Glu 
50 60 
3560 AAA CTC CTC GAG ATG ACC AAC GGC TCC ATC TGC TOC ACT CTG CGA 3604 
Lys Leu Val Glu Met Thr Asn Gly Cys lie Cys Cys Thr Leu Arq 
70 

3605 GAC GAT or CTC TCC GAG ATC ACT CCC TTG OCC GGC GAT GGC CGA 3649 
Glu Asp Leu Leu Ser Glu He Ser Ala Leu Ala Ala Asp Gly Arq 
80 90 
3650 TTC GAC TAC CTA CTC ATC GAA TCT TCG GGC ATC JOZ GAA CCG CTT 3694 
Phe Asp Tyr Leu Leu He Glu Ser Ser Gly He Ser Glu Pro Leu 
100 

3695 OCC CTC CCA GAG ACG TTC ACA TTC ATC GAT ACC GAC GGC CAC GCC 3739 
Pro Val Ala Glu Thr Phe Thr Phe He Asp Thr Asp Gly His Ala 
HO 120 
3740 CTC GGC GAC CTC GCC CGA CTC GAC ACC ATG CTC AGC GTC GTC GAC 3784 
Leu Ala Asp Val Ala Arq Leu Asp Thr Met Val Thr Val Val Asp 
130 

3785 GGC CAC ACT TTT CTC GGC GAC TAC ACG GCT GGG GGC CGC CTC GAA 3829 
Cly His Ser Phe Leu Arq Asp Tyr Thr Ala Gly Gly Arq Val Glu 
140 150 
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3830 GCC GAT 
Ala Asp 

3875 GAT CfiC 
Asp Gin 
170 

3920 CTC CTC 
Leu Val 

3965 TCT TTC 
Ser Leu 
200 

4010 ATC CCA 
He Pro 

4055 AAG GCT 
Lys Ala 
230 

4100 CAC ATC 
His He 



GCC CCG GAA 
Ala Pro Glu 

ATC GAA TTT 
He Glu Phe 

TCG CAC CAC 
Ser His Gin 

AAC GCA TCC 
Asn Ala Ser 

CTC GAC ACG 
Leu Asp Thr 

GCA GAG GCC 
Ala Gin Ala 

COC GAA ACC 
Pro Clu Thr 



GVC GAA 
Asp Glu 
160 
OCC GAC 
Ala Asp 

CAC CTG 
His Leu 
190 
OCT GOG 
Ala Ala 

ATT CTC 
He Lai 
220 
GCC GGA 
Pro Cly 

CMC GAC 
Clu Clu 



CGA GAC ATC OCC GAT CTG CTT CTC 3847 
Arq Asp He Ala Asp Leu Leu Val 

GTC ATC CTC CTC AGC AAC OCC GAT 3919 
Val He Leu Val Ser Lys Ala Asp 
180 

GTC GAA TTG ACC GCA GTC CTG CGC 3964 
Val Glu Leu Thr Ala Val Leu Arq 

ATA GTT CCG ATG AGC CTC GCT CGC 4009 
He Val Pro Met Thr Leu Cly Arq 
210 

GAC ACC GCT TTG trc TCA CTC GAA 4054 
Asp Thr Gly Leu Phe Ser Leu Glu 

TCC TTA CAA GAA CTC CAA GCT GAA 4099 
Trp Leu Gin C\u Leu Cln Cly Clu 
240 

TAC CGA ATC OCT TOG CTC CTC TAC 4144 
Tyr Cly He Cly Ser Val Val Tyr 
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27 28 
250 

4145 OX GAC COC CCA CCC TTC CAC CCC CAA CGG CTG CAT GAT TTC CTC 4189 
Arq Glu Arg Ala Pro Fhe His Pro Gin Arq Leu His Asp Phe Leu 
260 270 
4190 ACC AGC GAG TGG ACC AAC CGA AAC TTA err OGG GOC AAC GQC TAC 4234 
Ser Ser Glu Trp Thr Asn Gly Lys Leu Leu Arg Ala Lys Gly Tyr 
280 

4235 TAC TGG AAT GCC GGC CGG TTC ACC GAG ATC GGG ACT ATT TCT CAG 4279 
Tyr Trp Asn Ala Gly Arg Phe Thr Glu He Gly Ser lie Ser Gin 
290 300 
4280 GCC GCT CAT CTC ATT CGC CAC GGA TAC CTC OGC CCT TGG TGG AAC 4324 
Ala Gly His Leu He Arg His Gly Tyr Val Gly Arg Trp Trp Asn 
310 

4325 TTT OA CCC CCT GAC GVG TOG CCG GCC GAG GAT TAC CCT CCT GAC 4369 
Phe Leu Pro Arg Asp CTu Trp Pro Ala Asp Asp Tyr Arg Arg Asp . 
320 330 
4370 GGA ATC CTC GAC AAG TOG CAA CAA CCC CTC GGA GAC TGC CGA CAA 4414 
CTy He Leu Asp Lys Trp Glu Glu Pro Val Gly Asp Cys Arg Gin 
340 

4415 GAA CTC CTC TTC ATC GOC CAA GCC ATC GAC GCG TCT CGA CTC CAC 4459 
CTu Leu Val Phe lie CTy Cln Ala He Asp Pro Ser Arg Leu His 
350 360 
4460 OGA GAA CTC GAC GCG TCT CTA CTC ACC ACA GCC ATC GAA CTC 4504 
Arg Glu Leu Asp Ala Cys Leu Leu Thr Thr Ala Glu He CTu Leu 
370 

4505 GOG CCA CAC CTG TGG ACC ACT TGG AGC GAC GCC CTC GGC CTC GGC 4549 

CTy Pro Asp Val Trp Thr Thr Trp Ser Asp Pro Leu Gly Val Gly 
380 390 
4550 TAT ACC GAC CAC ACC CTT TGA TATTCCTCCCC TTTCCCCTAA CTTCTCCOC 4600 

Tyr Thr Asp Gin Thr Val *** 
4601 A ATCQCGCTAT CAOGCAACTG AACTCGOGCG TTCTTCATCC TCCCGOCGAT CGC 4654 
4655 ACCCGOC ACCCTOQCTG CAOCTAGQCC GQCGGGOGCC ACCTGGACAC TCOOXCG 4709 



4710 AT CTGCGCCGAC TGQCGGOTCG AAACCCTAGC GCAAOXJITC GAACAOCTOG AA 4763 
4764 AACCTCXSA GGCC 4775 
[01] miktT^^-iimBl^<DPCR^U-j:^>^ *^9kb<DEcoRI DN A»r>t^W-r fc<D. *JJ: 

(Dfc^t^mA,fcr ^ ymwmtnm^hx^i^A^iKm c/ffl^^*:/^^^ ^kpcnh-s EosnKKSiaa* 

[132] la2«:7'^-:^/^'^::^U^^'tf>->'3>rfit4 
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AMN- 1 



AMR - 2 



Tbr/*la T*l Al« Qlf Ul Pro let let 
5'-CA 6Tfi CCC COi CTG CCA ATC ATC-S' 
C C C T C 

Trp Asa Tal llfr Ala Tbr Glu/Asp Gly 
S'- CC TTG CAC TAC CC6 TCC CTA CCA-S* 
C C 
6 
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